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A new graph coloring algorithm is presented and compared to a wide variety of known algorithms. The 
algorithm is shown to exhibit 0(n 2 ) time behavior for most sparse graphs and thus is found to be particularly well 
suited for use with large-scale scheduling problems. In addition, a procedure for generating large random test 
graphs with known chromatic number is presented and is used to evaluate heuristically the capabilities of the 
algorithms discussed. 
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1 . Introduction 

Graph coloring has considerable application to a large variety of complex problems involving optimization. 
In particular conflict resolution, or the optimal partitioning of mutually exclusive events, can often be 
accomplished by means of graph coloring. Examples of such problems include: the scheduling of exams in the 
smallest number of time periods such that no individual is required to participate in two exams simultaneously 
(see appendix A), the storage of chemicals on the minimum number of shelves such that no two mutually 
dangerous chemicals (i.e., dangerous when one is in the presence of the other) are stored on the same shelf, 
and the pairing of individuals (as in a computer dating agency) such that the maximal number of compatible 
persons are paired together. 

In each of the above problems, the constraints are usually expressible in the form of pairs of incompatible 
objects (e.g., pairs of chemicals that cannot be stored on the same shelf). Such incompatibilities are usefully 
embodied through the structure of a graph. Each object is represented by a node and each incompatibility is 
represented by an edge joining the two nodes. A coloring of this graph is then simply a partitioning of the 
objects into blocks (or colors) such that no two incompatible objects end up in the same block. Thus, optimal 
solutions to such problems may be found by determining minimal colorings for the corresponding graphs. 
Unfortunately, this may not always be accomplishable in a reasonable amount of time. 

As the graph coloring problem is known to be NP-complete [l], 1 there is no known algorithm which, for 
every graph, will optimally color the nodes of the graph in a time bounded by a polynomial in the number of 
nodes. Since exponential time algorithms [5, 6, 7, 9, 18] are prohibitively expensive for use with large-scale 
problems, much attention has been focused on the development of heuristic algorithms which will usually 
produce a good, though not necessarily optimal, coloring for any graph in a reasonable amount of time. 

This paper describes a new graph coloring algorithm, the recursive largest first (RLF) coloring algorithm. 
In addition, a variety of existing coloring procedures are presented and their performance on a wide range of 
test data is compared to that of the RLF algorithm. 

Also described is a procedure for generating random graphs with known chromatic number. The existence 
of such a procedure, heretofore lacking in the experimental literature, provides a standard method for testing 
the accuracy of graph coloring algorithms. 
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2. Preliminary Definitions 

Throughout this paper, the graph G with nodes V and edges £", denoted by (V, E\ is assumed to contain no 
loops or multiple edges. The subgraph of G = (F, E) induced by a subset U of the nodes V consists of those 
nodes and all the edges that directly connect them. This subgraph is represented by < U > or (U, E') where 
E' = {(z^ 1? w 2 )\{w l , w^)eE, w l eU, w 2 eU}. The degree of a node weG, denoted by d(w), is the number of 
nodes adjacent to w in G. Define d v (w) to be the number of nodes in U adjacent to w in G. This is equivalent 
to the degree of w in < U U {w} > . 

A coloring of G is an assignment of colors to the nodes of G such that no two adjacent nodes share the same 
color. More formally, a k-coloring of G is a mapping/: V ^ {l, 2, . . . , A;} such that f(u) = f(v) only if (u, 
v) £ E. The chromatic number of G, denoted x(^X 1S me minimal number of colors necessary to color G. An 
optimal coloring of G is one which uses exactly x(^) colors. 

3. Sequential Coloring Algorithms 

One of the simplest coloring algorithms is the randomly ordered sequential (RND) graph coloring algorithm 
[14]. Given a graph G — (V, E), the algorithm randomly orders the nodes so that V = {i>i, . . . , v n } and then 
assigns colors to the nodes in the following manner. The first node, v x , is assigned color number 1. Once the 
first i nodes have been colored (1 < i < n — 1), v i+1 is assigned the lowest possible color number such that 
no previously colored node adjacent to v i+1 has been assigned the same color number. 

Though this algorithm is locally optimal in the sense that each node is assigned the smallest possible 
number, the overall action is highly dependent on the initial ordering of the nodes. For any graph, there exists 
an ordering for which this algorithm will produce an optimal coloring [14], while a less fortuitous ordering 
may lead to an extremely poor coloring. Thus the problem of finding an optimal initial ordering of the nodes 
is equivalent to the problem of optimally coloring the graph. 

This fact has led to the development of a large number of algorithms, each differing from RND only in the 
method of initially ordering the nodes [7, 14]. Two such algorithms are the largest first (LF) and smallest last 
(SL) sequential coloring algorithms. 

The LF algorithm orders the nodes such that d(vf) ^ d{v i+1 ) for 1 <i<n where V = {v\ , . . . , v n }. The 
SL algorithm is similar in strategy but recursively orders the smallest degree nodes last. An SL ordering is one 
in which d(v n ) = min d{iv) and for n — 1 > i > 1, du(v^ = min d v (w) where U = V — {v n , . . . , v i+1 }. 

weV wtU 

Note that both the LF and SL algorithms tend to order the high degree nodes before the low degree nodes. 
Computational experience has shown that this is generally a good strategy, whereas algorithms which color the 
higher degree nodes last have often been found to produce colorings worse than those produced by a random 
ordering. 

Each of the sequential coloring algorithms presented in this section requires 0(n 2 ) time and 0(n 2 ) space to 
color a graph with n nodes. Quadratic time and space complexities are generally quite acceptable for use with 
large-scale coloring problems. If only they gave guaranteed optimal colorings, we would look no further. 

4. More Sophisticated Algorithms 

One successful variation of the sequential coloring algorithms involves what is known as an interchange. 
Given any G = (V, E) and color function/ such that f(w) e{i, j} for all weV, and (i, j)-interchange on G is a 
redefinition of/ such that if f(w) = /originally, f{iv) is now assignedy and vice versa for all weV. 

Appropriate use of the interchange process has been found to yield particularly good results when used in 
conjunction with the LF and SL algorithms [14]. The resulting procedures are referred to as the smallest last 
with interchange (SLI) and largest first with interchange (LFI) coloring algorithms. 

The SLI (LFI) algorithm operates just like the SL (LF) algorithm except when the latter requires the 
introduction of a new color. Suppose that such a situation occurs when v m is the node to be colored and that 
k = max f{vi). For 1 < i < j < A:, define G i} to be the subgraph of G induced by the nodes of G previously 

i<m 

colored i or j. If possible, choose i andy such that no connected component of G^ contains two differently 
colored nodes both adjacent to v m . If such a G^is found to exist, then perform an (i,j) — interchange on each 
connected component of G^ ; which contains an t-colored node adjacent to v m in G. It is now possible to assign 
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color i to v m and thus the addition of a new color has been avoided. If no such G-^ exists, however, then 
regardless of what interchange is performed, v m must be assigned color A: + 1. 

This version of the SLI (LFI) algorithm initially appeared in [11] and is an extension of the original version 
which is described in [14]. The original version allows an (j, /) — interchange only when v m is adjacent to 
exactly one node colored i and one node colored^. There is little difference between the original and extended 
versions of the SLI and LFI algorithms in terms of colorings produced or time required. While the extended 
versions may be able to perform a useful interchange impossible in the original version, they will likely take 
slightly longer to do so. All four algorithms require 0(n 3 ) time and ()(n 2 ) space to color an n node graph. 
Based on a limited amount of computational experience, the extended version of the SLI algorithm (henceforth 
to be referred to simply as the SLI algorithm) was found to produce slightly better results than did the other 
interchange procedures. 

All of the algorithms thus far presented are capable of producing very bad colorings, in terms of number of 
colors used, for certain graphs. Johnson [10, 11] has given constructions of 3-colorable graphs on ()(n) 
vertices which each of the above algorithms requires n colors to color completely. Since no more than (){n) 
colors may be used to color an (){n) node graph, such colorings are, up to a constant, the worst possible. 

There is an algorithm, however, which will color any graph G with n nodes in 01 J x(^) or fewer 

\log n) 

colors. While this worst-case behavior is still unacceptable in practice, the approximately maximum 

independent set (AMIS) algorithm is interesting because it is the only known algorithm which is known not to 

exhibit the worst possible worst-case behavior [11]. The algorithm proceeds as follows. Given G = (V, E), 

select the node with minimum degree in G, say V\ , and color it 1. Once i nodes have been assigned color 1, 

select, if possible v /+] ec/such that du(v i+ ]) is minimal for nodes in U where U is the set of uncolored nodes 

not adjacent to any colored node. If no such selection is possible, i.e., U is empty, then repeat the entire 

procedure on the subgraph of G induced by the uncolored nodes of G, using the next available color. This 

process is then, in turn, repeated until all the nodes of G have been colored. 

Interestingly enough, while this algorithm exhibits better worst-case behavior than the other algorithms 

thus far discussed, computational experience has shown that, on the average, the colorings it produces are 

substantially inferior to those produced by the LF, SL, and SLI algorithms. 

5. The Recursive Largest First (RLF) Algorithm 

The RLF algorithm combines the strategy of the LF algorithm with the structure of the AMIS algorithm. 
Like the LF algorithm, at each step in the RLF procedure a node is selected for coloring which will, in some 
sense, leave the resulting uncolored nodes colorable in as few colors as possible. As with the AMIS algorithm, 
the RLF procedure completes the assignment of color i before commencing assignment of color i + 1. 

The RLF graph coloring algorithm proceeds as follows. Given G = (V, E\ assign color 1 to the node with 
maximal degree in G, say v x . Once i nodes have been assigned color 1, select, if possible, v i+1 eU } such that 
du 2 ( v i+i) * s m aximal for nodes in U 1 where V \ is the set of uncolored nodes not adjacent to any colored node 
and U2 is the set of uncolored nodes adjacent to at least one colored node. Ties are, if possible, broken by 
choosing the node that has minimal degree in < U\ > . If no such selection is possible, i.e., U\ is empty, 
then repeat the entire process recursively on the subgraph of G induced by the uncolored nodes of G, using 
the next available color. This recursion is then repeated until all of the nodes in G are colored. Several 
examples of this procedure are worked out in appendix A. 

As was true with the SLI algorithm, the RLF algorithm, in general, requires ()(n :i ) time and (){n 2 ) space to 
color an n node graph. Unlike the SLI algorithm, however, the RLF algorithm requires only (){n 2 ) time to 
color graphs for which k- e ~ n 2 where k is the number of colors used to color the graph, e is the number of 
edges in the graph, and n is the number of nodes in the graph (see appendix B for proof). Such graphs, which 
are usually sparse, quite commonly arise in practical applications such as exam scheduling. For example, the 
graph associated with the 1977-8 Princeton University fall term course examinations schedule consisted of 
273 nodes, 6727 edges, and required 17 colors to be colored by the RLF algorithm. Thus, for practical 
purposes, the RLF algorithm, if programmed properly, exhibits an 0(n 2 ) time dependence for many 
applications. Appendix B presents a PL-1 listing of the RLF algorithm as well as a rigorous analysis of its 
time complexity. 
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6. Generation of Test Graphs With Known Chromatic Number 

A few papers have been published which compare the performance of various algorithms on large (usually 
100-node) randomly generated graphs [14, 21, 23]. Unfortunately, none of these empirical studies provide 
the chromatic numbers of the test graphs used. Indeed, the task of closely approximating the chromatic 
number of a graph is NP-complete [8] and thus virtually impossible to accomplish for large graphs. 
Consequently, approximations of upper and lower bound results established for x(^) have generally been 
crude and of little practical use [1, 7, 14, 19]. 

The lack of such information makes an accurate interpretation of the experimental data very difficult. For 
instance, if algorithm A required 22 colors to color G while algorithm B required only 20, the conclusions 
drawn about their relative effectiveness if x(^) = 20 might be quite different from those drawn if x(£) = 4. 
Further, without knowledge of x(^)> no statement can be made at all about the accuracy or closeness to 
optimality of either algorithm A or B. Thus there is a need for a standard procedure for generating random test 
graphs with known chromatic numbers. Such a procedure will now be presented. 

Suppose it is desired to construct an Ai-node graph G with e edges and chromatic number k. For the 
purposes of the following argument, assume that k\ru This is not a significant restriction since most test or 
modeling uses of a large graph generator are likely to allow some flexibility in the choices of n and k. For such 
a graph to exist under these restrictions, e must be such that 

k(k- 1) n 2 (k - 1) 
< e f 



= MODfa^j + c, m) where MOD(Z, Y) = X 



* F. Sequences generated in such a manner exhibit 



2k 

The first step in the procedure is to choose positive integers a, c and m such that: 

1. m » n, 

2. (n, m) = k, 

3. (c, m) = 1, 

4. p\m—>p\(a— 1) for all primes p, and 

5. 4|m^ 4|(a - 1). 

Next generate a uniform sequence of random numbers {X t } on the interval to m — 1 by the linear 
congruential method described in [12]. This is accomplished by fixing X and, then for each i > 0, setting X f 

" X 
Y_ 

two important properties [12]. First, for every i and j such that < j < m — 1 and i > 0, there exists an r 
such that i < r < i + m — 1 and X r = j. Second, for every i > 0, X t = X i+m . 

Next construct the sequence {Fj on the interval to n — 1 so that Y\ = MOD(^-, n). Note that unless k 
= n, n\ m and { F;} is not a uniform random number sequence on the interval to n — 1. 

By defining V = \0, 1, . . . , n — l}, it is possible to associate two consecutive values of { F ? } with edges 
to be added to E. Similarly, it is possible to associate h consecutive values of {Fj with /i-cliques to 
be implanted in G. For example, the subsequence {Y A , F 2 , F 3 } corresponds to the subset 
{{v Y , Vy )•> ( v y » v Y»)i ( V Y » v y )}« By identifying certain subsequences of consecutive elements of { F ? } and 
adding the corresponding edges toE, it is possible to construct the desired graph G. 

More precisely, define the (A; — l)-vectorb = (b k , b k _ 1 , . . . , b 2 ) so that b k > 1 and b t > for 2 < i < 
/c — 1. Each b t corresponds to the number of i-cliques to be implanted in G. Specifically, given the sequence 
{Fj} and vector b, proceed as follows. Select the first k values of {Fj starting with Y x and add the 
corresponding edges to E. If b k > 1, select the next k values of { F,-} and add the corresponding edges to E . 
Repeat this process until 6 fc A:-cliques have been implanted in G. Next add, in an identical fashion, b k _ 1 
(k — l)-cliques to G. Continue the process until b 2 2-cliques or edges have been added to E. Note that some 
edges may be "added" several times and thus it may not be possible to precalculate a vector b such that there 
are exactly e edges in the resulting graph. It is possible, however, to keep track of how many edges have been 
added at any point and to eliminate the addition of i-cliques which might result in the addition of too many 
edges to E. Since edges may be added one at a time, it is not difficult to show that graphs having exactly e 
edges may be constructed in this manner for any e such that 
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k(k- 1) n 2 (k - 1) 

< e < . 

2 2A: 

It now remains to be shown that x(£) = ^ f° r an Y ^ constructed in this manner. Since b k > 1, G contains 
a A>clique and thus x(^) — k- Before establishing that x(^) — ^ ^ * s useful to examine the structure of the 
sequence { Y( } where Y / = MOD( Y t , A:). Since k | n and k \ ra, 

Y' i+1 =MOV(Y i+1 ,k) 

= MOD(MOD(* /+1 , n),*) 

= MOD(Z i+1 , k) 

= MODtMOD^, + c, m), k) 

= MQDiaX; + c, A) 

= MOD(MOD(f/^- + c, n), A) 

= MODiaY; + c, t) 
Y' M = MODiaY/ + c, fc). 

Further, 

p| A- — > />|//j — » p| (a — 1 ), 
4. | A; — > 4f\m — > 4|(a - 1). and 
(c, m) = I -> (c, A) = 1. 

Thus { Yj'} is a uniform sequence of random numbers on the interval to A- — 1 . 

This structure of the { Yj} modulo k allows the following coloring of G. For each i 9 define /{vy) = MOD(i, 
k). Since for ally such that < j < n^ there exists an i > such that Y t = j, it is clear that every node is 
assigned a color by this procedure. Since { Fj is a uniform sequence of random numbers on the interval from 
to k - 1, we know that if Y { = Y 3 , then Y/ = Y- and MOD(Z, A) = MOD( /. It) and thus that f(v Y .) = 
f(v Y ). This means that/ is well defined. Finally, it is easily verified that v Yr v Y , , . • - , v Y i+h _ are al l 
colored differently if h < A;, for all i > 0. This means that edges occur only between differently colored nodes, 
and that/ is a proper coloring of G . Thus x(^) = &• 

It should be noted that the above result is a special case of a more general result for arbitrary k and n. That 
is, if A; and n are such that k < d where d = {n, m), then k < x(^) — A: + MOD(<i, A) < 2A:. The proof of the 
general result is not given here but is similar to that of the special case when MOD (J, k) — 0. 

As will be demonstrated shortly, the range of graphs which can be generated by this procedure is quite 

large. The node degrees of such graphs may vary between and n while the average node degree may 

k 
fctfc — \\ k — \ 

vary between and n. The variety of distributions of node degrees is also quite large. Most 

n k 

importantly, however, the procedure generates graphs which are as difficult to color as are randomly generated 
graphs (where the chromatic number is not known). Demonstration of this fact is provided in section 7. 

Another advantage of this procedure is that the test graphs may be easily characterized. For example, only 
k + 5 values are required to generate an n-node graph with chromatic number A:. These values are n, k, Xo , 
a, c, m, b k , b k - x , . . . , 6 3 and b 2 . Whereas it would be infeasible to completely describe a large, randomly 
generated graph by conventional means in a short paper, graphs generated by this procedure are easily 
described. Thus, in future publications concerning the effectiveness of various graph coloring algorithms, it 
will be possible to specify precisely which graphs were used to test the various algorithms. There are several 
conceivable situations where such documentation could be valuable to the interested reader. For example, 
should the reader desire to compare the effectiveness of a new graph coloring algorithm to those in the 
literature, he would need only to regenerate the graphs used in published tests and color them with the new 
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algorithm. This would eliminate the necessity of developing an entirely new set of test data and of having to 
rerun all previous algorithms on such data. Pursuant to these goals, a complete characterization of the test 
graphs referred to in tables 1 and 2 is provided in appendix C. 

7. Test Results 



The procedure described above was used to generate 27 150-node graphs and 12 450-node graphs of 
varying edge density and chromatic number. In addition, 27 completely random 150-node graphs were 
generated with varying edge density and unknown chromatic number. The RND, LF, SL, RLF and SLI 
algorithms were tested on each of the 66 graphs. The resulting data are displayed in tables 1, 2, and 3, 
respectively. 

In each table, the graphs are subdivided into groups according to chromatic number, ^, (or as in the case 
with the completely random graphs, to a known lower bound for \) an d average node degree, d. There are 

Table I. Results for 150-node test graphs generated according to the procedure detailed in section 6. 



X d 


number of colors used 


average number of excess colors used 


RND 


LF 


SL 


RLF 


SLI 


RND 


LF 


SL 


RLF 


SLI 


5 11 
19 

24 

Ave. total 


(9, 9, 9) 
(11, 11, 12) 
(13, 12, 13) 


(7, 8, 7) 

(10, 10, 9) 

(10, 11, 11) 


(7, 7, 8) 

(9, 10, 9) 

(10, 9, 10) 


(6, 6, 6) 

(7, 7, 8) 
(7, 6, 7) 


(6, 6, 6) 
(9, 5, 8) 

(8, 7, 6) 


4 
6V3 

7 2 /3 

6 


2Va 

4 2 /3 
5 2 /3 

42/9" 


2V 3 
4Va 

4 2 /3 
3^/9~ 


1 
2Va 

l 2 /3 

Vli 


1 
2Va 

2 

TV9* 


10 11 

21 

29 

Ave. total 


(11, 11, 12) 
(15, 14, 16) 
(17, 16, 18) 


(10, 10, 10) 
(12, 12, 12) 
(14, 14, 14) 


(10, 10, 10) 
(12, 12, 12) 
(14, 15, 15) 


(10, 10, 10) 

(11, 11, 11) 

(13, 13, 12) 


(10, 10, 10) 
(11, 11, ID 
(13, 13, 12) 


IV3 
5 

7„ 
4 4 / 9 



2 
4 

2 



2 
4 2 /a 

2 2 /9 




1 

2 2 /3 
l 2 /9 



1 

2 2 /3 
l 2 /9 


15 12 
25 
34 

Ave. total 


(15, 16, 15) 
(19, 18, 18) 
(19, 21, 20) 


(15, 15, 15) 
(17, 16, 16) 
(17, 17, 18) 


(15, 15, 15) 
(16, 16, 15) 
(17, 19, 17) 


(15, 15, 15) 
(15, 15, 15) 
(16, 16, 16) 


(15, 15, 15) 
(15, 15, 15) 
(16, 16, 16) 


Va 
3V3 
5 

2 8 /9 




lVs 

2Va 

P/9 




2 /3 

2 2 /3 

1V8 




1 




1 

^3" 


Overall 
average 
total 

Total 
time 
(seconds) 


10.2 


13.3 


13.5 


15.3 


49.2 


4 12 /27 


2 13 /27 


2 10 /27 


l 2 /27 


l 3 /27 



Table 2. Results for 450-node test graphs generated according to the procedure detailed in section 6. 



X d 




num 


ber of colors used 




average number of excess colors used 


RND 


LF 


SL 


RLF 


SLI 


RND 


LF 


SL 


RLF 


SLI 


5 25 
43 


(14, 13) 
(17, 18) 


(11,12) 
(12, 14) 


(11, 12) 
(11, 15) 


(8,8) 

(5, 5) 


(10,9) 
(5,5) 


8V2 
12V2 


6V2 
8 


6V2 
8 


3 




4Vi 



Ave. total 












IOV2 


7V4 


7V4 


IV2 


2V4 


15 36 

74 


(22, 22) 
(30, 31) 


(18, 18) 
(26, 26) 


(18, 18) 
(26, 26) 


(17, 16) 
(23, 23) 


(16, 16) 
(23, 24) 


7 
15V2 


3 
11 


3 
11 


IV2 
8 


1 
8V2 


Ave. total 












11V4 


7 


7 


4 3 /4 


4 3 /4 


25 37 

77 


(29, 27) 
(37, 35) 


(26, 25) 
(29, 30) 


(25, 25) 
(31, 31) 


(25, 25) 
(28, 28) 


(25, 25) 
(28, 29) 


3 
11 


Va 
4Va 



6 



3 



3Vi 


Ave. total 












7 


2V2 


3 


IV2 


l 3 /4 


Overall 












9 7 /l2 


5 7 /l2 


5 9 /l2 


2 7 /l2 


2"/ia 


average 
total 
Total 


32.0 


42.8 


44.9 


80.7 


308.8 












time 
(seconds) 
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Table 3. Results for random ISO-node test graphs. 




Lower 
Bound 
on x 


d 


RND 


LF 


SL 


RLF 


SL1 


5 


10 
18 
24 


(9, 9, 8) 
(11, 11, ID 
(13, 13, 12) 


(7, 7, 7) 
(10, 10, 10) 
(10, 12, 11) 


(7, 7, 7) 
(10, 10, 0) 

(11, 11, 11) 


(6, 6, 6) 
(8, 8, 8) 
(9, 9, 9) 


(6, 6, 6) 

(9, 8, 8) 

(10, 10, 10) 


10 


10 
18 
24 


(16, 15, 16) 
(16, 17, 16) 
(17, 18, 17) 


(15, 15, 15) 
(15, 15, 15) 
(15, 16, 16) 


(15, 15, 15) 
(15, 15, 15) 
(15, 15, 15) 


(15, 15, 15) 
(15, 15, 15) 
(15, 15, 15) 


(15, 15, 15) 
(15, 15, 15) 
(15, 15, 15) 


15 


8 

16 
24 


(24, 22, 21) 
(23, 23, 24) 
(24, 24, 24) 


(24, 22, 21) 
(23, 23, 24) 
(24, 24, 24) 


(24, 22, 21) 
(23, 23, 24) 
(24, 24, 24) 


(24, 22, 21) 
(23, 23, 24) 
(24, 24, 24) 


(24, 22, 21) 
(23, 23, 24) 

(24, 24, 24) 


Total time 
(seconds) 


9.8 


12.9 


12.9 


19.0 


384.3 



three graphs in each 150-node group and two graphs in each 450-node group. The numbers in parentheses 
indicate the number of colors used by an algorithm to color the first, second, and possibly, third graph of that 
category. For the graphs where x was known, the average number of excess colors used by the algorithm was 
computed for each group and totaled. For example, the RLF algorithm optimally colored each of the 10- 
colorable 150-node graphs with average degree 11 in table 1 but required, on the average, 4 3 A extra colors to 
color each of the four 15-colorable 450-node graphs in table 2. 

The total run time for each algorithm is also included in each table. This figure represents execution time 
in seconds on an IBM 360-91. It should be noted that such figures are highly dependent on factors unrelated 
to the inherent efficiency of the algorithm, such as programmer skill and machine characteristics. The time 
complexity estimates provided earlier are much more rigorous measures of the algorithms' relative speeds. 
Except for the SLI time in table 3, the run times are in accordance with these theoretical estimates. It is quite 
possible that the graphs referenced in table 3 have chromatic numbers much higher than the minimum 
estimate, and that the SLI algorithm was thus induced to attempt and possibly perform a large number of 
time-consuming interchanges. This example points out the highly variable amounts of time required by most 
interchange algorithms to color various graphs (a phenomenon also observable in the data of [14] ). 

The random graphs of table 3 were included only for the purpose of demonstrating that the graphs generated 
by the technique discussed in section 6 are just as suitable for testing the relative capabilities of graph 
coloring algorithms as are completely randomly generated graphs. As was pointed out earlier, the data in 
table 3 cannot be used to draw conclusions about the accuracy of the tested algorithms. From the data in 
tables 1 and 2, however, we observe that, for the graphs considered, the LF and SL algorithms required about 
twice as many extra colors to color the graphs as did the RLF and SLI algorithms. Similarly, the RND 
algorithm required about twice as many extra colors as did the LF and SL algorithms. Significantly, this 
observation can be made for most of the graphs on an individual basis. The RND algorithm always used more 
colors than the LF and SL algorithms which, in turn, always used more colors than the RLF or SLI algorithms. 

There is not as clear a distinction between the performance of the LF and SL algorithms or the RLF and SLI 
algorithms. The LF and SL algorithms required virtually the same number of colors on the average and 
required nearly the same amount of time. While the colorings produced by both the RLF and SLI algorithms 
for test graphs were, on the average, quite good, the RLF algorithm required substantially less time and used 
approximately 12 percent fewer excess colors on the 450-node graphs and 3 percent fewer excess colors on 
the 150-node graphs than did the SLI algorithm. Of the eight 450-node graphs which were not optimally 
colored by both the RLF and SLI algorithms, the RLF algorithm required the fewest colors for four of the 
graphs and the most for only one graph. 

As a final note, the edge density, - , of the test graphs did not exceed V4. This results from the fact that for 

most large-scale practical applications, the edge density of the graphs to be colored is generally small. For 
instance, the Princeton University exam scheduling graph mentioned in Section 5 had an edge density of 
approximately 1 /6. 
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8. Conclusions 

From the data presented it is apparent that the RLF algorithm, when not optimal, colored large graphs with 
substantially fewer colors than did any of the other algorithms that did not involve interchanges. When 
compared with interchange algorithms, the RLF algorithm was found to produce slightly better colorings in 
substantially less time. While the RLF and interchange algorithms in general each require 0{n 3 ) time to 
color an n-node graph, the RLF procedure is unique in that it exhibits 0{n 2 ) time behavior for graphs with 
low edge density. Thus the RLF algorithm is particularly well suited for use with large-scale practical 
problems. 

The method described in section 6, for generating random graphs with a known chromatic number, was 
found to produce test data which can be used to determine heuristically a given algorithm's accuracy as well 
as algorithms' relative capabilities. Previously, published comparison tests have been made only on graphs 
with unknown chromatic numbers, which rendered impossible any evaluation of an individual algorithm's 
accuracy and questionable any statement about two algorithms' relative capabilities. In addition, the 
procedure provides a standard method of generating test data for coloring algorithms; by its use a large graph 
with known chromatic number may be uniquely constructed from only a few parameters. 



In addition to Professor Acton, the author would like to thank Dr. Charles Johnson, Dr. James Lawrence, 
and Dr. Alan Goldman for their helpful remarks. 
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10. Appendix A: Application to Examination Scheduling 

The examination scheduling problem is probably the best known of a large class of scheduling problems in 
applied mathematics and operations research. It consists of scheduling exams such that no individual is 
required to participate in two or more exams simultaneously. It is usually assumed desirable to schedule the 
exams such that the total number of time periods required for the examinations is minimized. Sometimes, 
additional restraints are imposed. Requiring that some examinations be given or not given in specified time 
periods and scheduling the exams so that a certain subset of the exams will be completed as early as is 
possible are examples of such restraints. 

Consider the following exam scheduling problem. 



Exam 


Participant 


s 


Exam 


Participants 


1 


A X ,A 2 , A 3 , 


A 6 


7 


A 4 , Aq 


2 


#1, #2, B 3 




8 


B 3 , # 6 , B 7 


3 


B 2 ,B 4 , 




9 


£>5 , />7 , n# 


4 


/l ] ; /»4 . t* 4 




10 


/4,, #,, B 4 


5 


«,.«:, 




1 1 


# 3 , B 7 


6 


A 2 ,A.,.I} { 


Figure 


12 
1 


A 3 ,A 5 



Time Peru 
1 
3 
5 



Exams that may not 

be s cheduled 

4, 11, 12 

3, 7 

1, 10 



Figure 2 



In addition to the information contained in figures 1 and 2, assume that we also know that exam 2 must be 
scheduled in time period 1 and that the final schedule must be such that the last exam involving a participant 
of type A is scheduled as early as possible. 

We will now proceed to solve the above scheduling problem utilizing the RLF graph coloring algorithm. 

Since exam 2 must be scheduled in time period 1, we will do so and amend figure 2 so that exams 
incompatible (i.e., may not be scheduled concurrently) with exam 2 will not be scheduled in time period 1. 
This information is included in figure 3. 



Time Period 
1 
3 
5 



Exams that may not 

be scheduled 

3, 4, 5, 6, 8, 10, 11, 12 

3, 7 

1, 10 



Figure 3 



The restriction placed on exams involving type A participants may be satisfied, as far as is possible by 
heuristic means, by scheduling the exams involving type A individuals first and then, using this information, 
scheduling the remaining exams. The graph in figure 4 contains the information necessary for the first step. 

Node E\ [represents exam i and node Fj represents time period j for all i, j. There is an edge between every 
pair of time period nodes to insure that no two time periods are assigned the same color. An edge is inserted 
between node E t and node Ej if and only if exams i and j may not be scheduled simultaneously. Finally, an 
edge is inserted between node E\ and node T ] if and only if exam i may not be given during time period j. 
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Figure 4 
To obtain the desired s h ? 1 

5' ^4, /s 6 , ^io, and 




Figure 5 



Only nodes £, and £ 7 remain in f/ ^ r • 
only two nodes in U 2 . Thus node F ' , , ' ' " adjacent to «ve nodes in II U-, , 

--led (ff, ls e^ a„ d h'te ' ' S ^ ' ^ " 0de *' ™^- ThTs 1 av/ ^ * 7 " adjaCent «° 

Procedure on the Siting Iraoh ! I ^T ^ C ° ] ° red "^es fl- ^ J* £ ^ "° de «*« colored or 
— crcled, we have ^ S&? ^ ' ^ * ^^^S^ ~ 




Figure 6 



Both nodes E 7 and E i0 are adjacent to one node in U 2 while all other nodes in V \ are not adjacent to any 
node in £/ 2 • Since E 7 is connected to only one node in U x while d v (E ]0 ) = 2, E 7 is colored 2 and E 4 is 
circled. This leaves the graph in figure 7. 




Figure 7 



Completing the assignment of color 2, E 10 is assigned color 2, node £ 6 is circled and, finally, E 12 is colored 
2. Since t/j is now empty, we delete the colored nodes and repeat the process on the graph in figure 8 using 
color 3. 




Figure 8 
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This graph is trivially colored by assigning color 3 to nodes T 3 , E 4 and E 6 ; color 4 to T 4 ; and color 5 to T 5 . 

The schedule may now be constructed by assigning to each time period those exams which were assigned 
the color of that time period. Should the number of colors used exceed the number of time period nodes used, 
the additional colors may be arbitrarily associated with additional time periods (assuming they exist). In this 
example, however, the number of time period nodes, 5, exceeded the number of colors used, 3, and no such 
additional assignment of time periods was necessary. The resulting schedule is displayed in figure 9. 



Time Period 


Exams 


1 


1 


2 


7, 10, 12 


3 


4,6 


4 




5 





Figure 9 



This completes the scheduling of exams involving type A individuals. We must now schedule the remaining 
exams taking into account the partial schedule in figure 9 and the information displayed in figure 3. This 
information is summarized in figure 10. 



Exams that may not 
Time Period be scheduled 

1 3, 5, 8, 11 

2 3,5 

3 3,5 

Figure 10 



Combining this information with that in figure 1, the graph in figure 11 is readily constructed. 
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All that remains is to color the graph displayed in Figure 11. This is easily done using the RLF algorithm. 
The final coloring is displayed in figure 12. 




Figure 12 



Note that, depending on the order in which the nodes were considered, En could have been assigned color 
2 and E 8 color 3 since the two nodes are identical as far as the RLF algorithm is concerned. This illustrates 
the fact that the final colorings assigned to a graph may, to a small extent, depend on the initial ordering of 
the nodes (i.e., on the manner in which nodes with identical characteristics are distinguished). Finally, since 
4 colors were necessary to color the graph, those exams colored 4 will be scheduled in the first available, 
unused time period, the fourth time period. 

This results in the partial schedule in figure 13. 



Time Period 




Exam 


1 




9 


2 




8 


3 




11 


4 




3, 5 




Figure 


13 



The completed exam schedule for all exams is displayed in figure 14. In this particular case, the schedule 
produced is optimal. 



Time Period 


Exam 


1 




1, 2, 9 


2 




7, 8, 10, 12 


3 




4, 6, 11 


4 




3, 5 

Figure 14 
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1 1 . Appendix B: Computer Implementation of the RLF Algorithm 



As was demonstrated in Appendix A, the RLF algorithm can be used to color small graphs easily by hand. 
For large graphs, however, such a task can only be accomplished reliably by computer. In order to program 
the RLF algorithm efficiently, the values of d v and d v must be stored for each node and updated whenever 
JJ \ or U 2 is modified. This is accomplished by defining two arrays, E and F : 



I<0 if w is colored 1 
<0 if weU 2 | . 

= d Vl (w) if well,) 

I<0 if w is colored 
= d^u^U) if n eU 2 \ 
= d UlUU2 (w) if weU x ) 



Initially U x consists of every node in G and thus E(w) = F( w) = d{ w) for all weV. If node w ' is selected 
for coloring, then we must modify E and F so that: 

E(w')*--1, 

E(w) « 1 for w adjacent to w ' , 

E(w) *— E(w) — (the number of nodes in U 1 adjacent to both w and w') 

for iveU x and nonadjacent to w\ 
F{w')<r--1, and 
F{w) *—F(w) — 1 for w adjacent to w' . 

The next node to be colored will, of the nodes in U x , then have the maximum value of F(w) — E(w) and, 
if a tie exists, the minimum value of E(w) among those nodes that tied. When U 1 is empty (this corresponds 
to E(w) < for all weV), the values of E are modified so that E(w) <— F(w) for all weV. This corresponds 
to a reinitialization of the values of E for the subgraph of G induced by the uncolored nodes. 

The above operations on E and F can be easily performed by appropriate use of the following subroutine, 
procedure DELETE. Given an array D and node w ', DELETE performs the following operations on D : 

D{u') *- -1 

D(w) <— D(w) — 1 for w adjacent to w f . 



Thus whenever node w ' is selected for coloring we may modify E and F by simply performing DELETE on 
(F, w' ) and (£\ w' ) as well as on (E, w) for all w in l7j adjacent to w ' . It is not difficult to verify that such a 
procedure maintains the desired values ofE andF. 

A complete PL-1 computer program listing of the RLF procedure is included at the end of this appendix. 
The program is written in subroutine form and assumes that values for CI and CL are provided on input. Array 
CI serves as an index array to CL, the node adjacency list. For example, the nodes adjacent to the ith node 
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are sequentially stored in CL(CI(i — 1 ) + 1), CL(CI{i — 1) + 2) CL(CI(i)). The data are stored in 

this compact form to minimize the amount of storage required, a very important consideration when working 
with large graphs. 

Each node w in the graph is processed by procedure DELETE, i.e. deleted, /(/<) times where /(/c) is the 
color number assigned to w. This claim is easily established by observing that for each new color/ introduced, 
i < f(w), w is either colored i or adjacent to a node colored i. In either case w is deleted exactly once. Once 
w is colored, then it may never subsequently be deleted. Thus exactly 



2j irij < Zj k n i ~ k : ' 2j n i ~ n ^ 



deletions are performed on G, where k is the number of colors used to color G, tij is the number of nodes 
colored i and n is the number of nodes in G. Since each deletion requires (){d) time where d is the average 
degree of a node, all the deletions may be accomplished in O(kdn) time. It is easily checked that all other 
operations may be accomplished in (){n 2 ) time. Thus the algorithm requires 0(n 3 )time and (){n 2 ) space to 
color an arbitrary n node graph. However, for those graphs where kd — 0(n) or, equivalently, kc = ()(n 2 ). 
the RLF algorithm consumes only ()(n 2 ) time and (){n 2 ) space in coloring the graph. As was pointed out in 
the text, many large-scale practical problems involve graphs for which this property holds and thus may be 
colored with the RLF algorithm in ()(n 2 ) time. 
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ELF: PROCEDURE CN,CI , CL, COL, C , E , F ) ; 

/# */ 

/* THIS SUBROUTINE COLORS THE N NODE GRAPH DEFINED BY CI AND CL */ 

/* USING THE RLF ALGORITHM. »/ 

/* */ 

/* ON INPUT: */ 

/* */ 

/* N IS THE NUMBER OF NODES IN THE GRAPH. */ 

/* CI IS THE INDEX VECTOR FOR CL. */ 

/* CL IS THE NODE ADJACENCY LIST FOR THE INPUT GRAPH. FOR */ 

/* EXAMPLE, CLCCICI-1 > + I ) , CL( C I < I - 1 ) +2 ) , ..., CLCCICD) */ 

/* CONTAIN, IN SEQUENCE, THE C I C I ) -CI ( I - 1 ) NODES ADJACENT */ 

/* TO NODE I. */ 

/* COL, C, E AND F ARE MEANINGLESS. */ 

/* */ 

/* ON RETURN: */ 

/* */ 

/• N, CI AND CL ARE UNCHANGED. •/ 

/* COL IS THE NUMBER OF COLORS USED TO COLOR THE GRAPH. */ 
/« C IS THE COLOR FUNCTION ASSIGNED TO THE GRAPH. FOR EXAMPLE, */ 

/* C(I) IS THE COLOR ASSIGNED TO NODE I. */ 

/* E AND F ARE MEANINGLESS. */ 

/* */ 
DECLARE CN,CI(*) ,COL,CC*> ,E<*) ,FC*> ,J,L, I ,K) BINARY FIXED (31), 
CLC*) BINARY FIXED (15); 

/* */ 

/* INITIALIZE THE COLOR FUNCTION TO ZERO. »/ 

/* »/ 

C = ; 

COL=0 ; 

J = ; 

L=l; 

/* */ 

/* INITIALIZE THE F VECTOR TO THE NODE DEGREES. */ 

/* ♦/ 
DO 1=1 TO Nj 

FC I)=CICI)-CIC 1-1 ) ; 

END; 

/♦ */ 

/* IF THERE ARE ANY UNCOLORED NODES, INITIATE THE ASSIGNMENT OF •/ 

/* THE NEXT COLOR. */ 

/* »/ 
DO WHILE CJ<N); 

COL=COL+i ; 

/* */ 

/* REINITIALIZE THE E VECTOR. */ 

/* »/ 

DO 1=1 TO N; 
EC I)=FC I) ; 

END; 

/* */ 

/* SELECT THE NODE IN Ul WITH MAXIMAL DEGREE IN Ul. */ 

/# #/ 

DO 1=1 TO N| 

IF FC I )>FCL) THEN L=I 5 
END; 

/* */ 

/* COLOR THE NODE JUST SELECTED AND CONTINUE TO COLOR NODES WITH */ 
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/* COL UNTIL Ul IS EMPTY. */ 

/* ■■:/ 

DO WHILE C£CL)>=0) } 

/* */ 

/* COLOR NODE AND MODIFY Ul AND U<d ACCORDINGLY. */ 

/* */ 

call delete ce,l) ; 
call delete (f,l) 5 
ccd=col; 
j=j+i ; 

IF CI CL)>CI CL-1 ) THEN DO I=CI(L-1)+1 TO CI(L); 
IF ECCLCI))>=G THEN CALL DELETE CE,CL(I)); 

END; 
/a */ 

/** FIND THE FIRST NODE IN ul, IF ANY. */ 

/* #/ 

K=0 ; 

DO 1=1 TO N WHILE (K=G) 5 
IF E C I ) >=G T REN K= I ; 

END ; 
/* */ 

/* IF Ul IS NOT EMPTY, SELECT THE NEXT NODE FOR COLORING. */ 

/* »/ 

IF K>0 THEN DO; 

l=k; 

DO I=X TO I\); 

IF EC I )> = THEN DO; 

IF FC I )-E( I )>FCL)-ECL) THEN L= I ; 

ELSE IF FC I )-E< I )=FCL)-ECL) & ECI)<E:CL) THEN L=I; 
END; 
END; 
END; 
END; 
END; 
END RLF ; 

/* */ 

/« */ 

/* */ 

DELETE: PROCEDURE CH,M); 

/* */ 

/* THIS SUBROUTINE DECREMENTS THE VALUE OF H FOR M AND NODES */ 

/* ADJACENT TO M. */ 

/* */ 

/* ON INPUT: */ 

/* */ 

/« H IS EITHER THE E OR F VECTOR FOR THE GRAPH. V 

/* M IS THE NODE TO BE DELETED. */ 

/* */ 

/* ON RETURN: */ 

/* */ 

/* H IS APPROPRIATELY UPDATED. */ 

/* M IS UNCHANGED. V 

/» */ 

DECLARE CH(»),M,P) BINARY FIXED Col); 

H c x ) = - 1 ; 

IF CICM)>CI C: - i ) THEN DO ?=GI(:-1)+1 TO CICX); 
HCCLCP) )«H(CLCP) )- 1 ; 

END; 

END DELETE; 
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12. Appendix C: Characterization of Test Graphs 

The data provided in tables 4 and 5 of this appendix provide the necessary information to regenerate each 
of the test graphs used in the preparation of the data included in tables 1 and 2, respectively. Each graph is 
referenced by its chromatic number, average node degree, and order in which the colorings (or values of X ) 
are given. For example, the parameters for the third 150-node graph with chromatic number 5 and average 
node degree 11 (i.e., the graph that SL 8-colored) are: n = 150, k = 5, a = 8401, c = 6859, m = 84035, 
& 5 = 19^ \ H = 60, 6 3 = 97, b 2 = 210, and X = 22093. Using these parameters, it is possible to regenerate 
the graph by using the procedure described in section 6. 











Table 4. 


Generation 


Parameters for the Test Graph Used in Table I. 




X 


d 


n 


k 


a 


c 


m 


b 


3 values of X 


5 


11 


150 


5 


8401 


6859 


84035 


(19, 60, 97, 210) 


0,33289,22093 




19 


150 


5 


8401 


6859 


84035 


(39, 120, 195, 420) 


21047,55697,74912 




24 


150 


5 


8401 


6859 


84035 


(58, 180, 292, 630) 


78692,83491,52870 


10 


11 


150 


10 


8401 


6859 


168070 


(10, 0, 0, 12, 0, 0, 25, 0, 2, 4) 


8879,64827,78005 




21 


150 


10 


8401 


6859 


168070 


(21, 0, 0, 24, 0, 0, 51, 0, 48) 


5293,59845,102567 




29 


150 


10 


8401 


6859 


168070 


(31, 0, 0, 36, 0, 0, 76, 0, 72) 


107489,118239,101759 


15 


12 


150 


15 


8401 


6859 


252105 


(4, 0, 0, 0, 0, 7, 0, 0, 0, 0, 12, 0, 0, 148) 


80589,60363,94632 




25 


150 


15 


8401 


6859 


252105 


(9, 0, 0, 0, 0, 15, 0, 0, 0, 0, 24, 0, 0, 297) 


220881,67107,198723 




34 


150 


15 


8401 


6859 


252105 


(13, 0, 0, 0, 0, 22, 0, 0, 0, 0, 36, 0, 0, 445) 


66684,189309,9534 



Table 5. Generation Parameters for the Test Graphs Used in Table 2. 



X 


d avg. 


n 


k 


a 


c 


m 


b 


2 values of X 


5 


25 


450 


5 


8401 


6859 


84035 


(175, 540, 877, 1890) 


0,41794 




43 


450 


5 


8401 


6859 


84035 


(409, 1260, 2047, 4410) 


35428,47927 


15 


36 


450 


15 


8401 


6859 


252105 


(40, 0, 0, 0, 0, 67, 0, 0, 0, 0, 108, 0, 0, 1336) 


36276,213549 




74 


450 


15 


8401 


6859 


252105 


(94, 0, 0, 0, 0, 157. 0, 0, 0, 0, 252, 0, 0, 3118) 


161712,160056 


25 


37 


450 


25 


8401 


6859 


420175 


(13, 0, 0, 0, 0, 0, 0, 0, 0, 27, 0, 0, 0, 0, 0, 
0, 0, 0, 40, 0, 0, 0, 0, 1336) 


192625,358531 




77 


450 


25 


8401 


6859 


420175 


(31, 0, 0, 0, 0, 0, 0, 0, 0, 63, 0, 0, 0, 0, 0, 
0,0, 0,94,0,0,0, 0, 3118) 


247337,274955 
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